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ABSTR,\CT 

Statistical  Decision  Theor}’  is  applied  to  assessing  objectively  the  uncertainty 
involved  in  foreign  policy  decisions.  In  an  international  conflict  of  interest  situation,  a 
protagonist  is  presented  with  the  problem  of  estimating  what  course  of  action  an 
antagonist  is  going  to  pursue.  This  problem  is  addressed  by  taking  observations  of  the 
antagonist's  behavior.  These  observations  are  interpreted  as  being  associated  with  a 
specific  course  of  action.  Bayes  formula  is  then  used  to  update  a  conjectured  a  priori 
probability  function  to  estimate  the  course  of  action  being  pursued  by  the  antagonist. 
This  updated  {a  posteriori)  conditional  probability  function  is  then  used  to  develop  a 
decision  rule  to  select  an  appropriate  response.  The  decision  rule  is  based  on  an 
ordering  of  possible  outcom.es,  the  values  assigned  to  those  outcomes,  and  greatest 
expected  value  of  return.  '  'Yk  --  ’ 
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I.  INTRODUCTION 

This  thesis  models  foreign  policy  decisions  using  a  2  x  2  decision  under  risk 
matrix  with  interval  scaled  payoffs.  The  general  application  concerns  two  nations  with 
a  conflict  of  interest  situation  existing  between  them.  Conflict  of  interest  situations  will 
continue  to  arise  as  a  result  of  the  ideological  differences  that  exist  in  the  world  today. 
These  are  part  of  the  overall  East  vs.  West  protracted  conflict  as  characterized  by 
Strause-Hupe,  Kintner,  Dougherty  and  Cottrell  in  their  book  Protracted  Conflict.  "The 
United  States  in  this  atmosphere  will  be  repeatedly  confronted  with  the  enigma  of 
making  a  decision  in  the  face  of  uncertainty."  [Ref  1|  Foreign  policy  decisions  involve 
deciding  what  course  of  action  is  the  most  appropriate,  given  the  evidence  at  hand.  The 
courses  of  action  available  could  be  considered  either  peaceful  or  aggressive  actions. 
The  evidence  at  hand  could  be  the  behaviors  the  antagonist  is  presenting  exhibiting.  Are 
his  actions  peaceful  in  the  present  situation  or  are  they  aggressive? 

A  conflict  of  interest  situation  involving  the  United  States  and  the  Soviet  Union 
could  be  considered  as  a  simple  2x2  decision  under  risk  matrix,  where  the  United 
States,  considered  as  the  decision  maker,  would  choose  one  of  two  alternatives 
(peaceful  or  aggressive  action).  The  Soviet  Union  could  be  considered  to  have  these 
same  two  courses  of  action  open  to  it.  For  the  purposes  of  this  thesis,  all  actions  arc 
considered  to  be  one  of  these  two  alternatives. 

A.  CATEGORIZING  PEACEFUL  OR  AGGRESSIVE  ACTIONS 

Grouping  the  alternative  courses  of  action  into  the  appropriate  category'  of 
peaceful  or  aggressive  should  be  a  fairly  easy  task.  For  example,  during  the  Berlin 
crisis,  troop  and  combat  aircraft  buildups  in  the  vicinity  of  Berlin  by  the  Soviet  Union 
might  be  interpreted  by  the  United  States  as  the  U.S.S.R.  pursuing  a  hot-war 
(aggressive)  course  of  action.  On  the  other  hand,  the  U.S.S.R.'s  permitting  .■Xmcrican 
aircraft  the  use  of  prescribed  air  corridors  while  in-llight  to  Berlin  might  be  interpreted 
to  m.ean  that  the  U.S.S.R.  is  pursuing  a  cold-war  (peaceful)  course  of  action. 

Mistorical  examples  of  exercising  a  hot-war  course  of  action  by  the  United  States 
might  include: 

•  the  landing  and  liberation  of  Grenada  by  U.S.  .Armed  Forces. 

•  the  sending  of  Marines  into  Lebanon. 

•  and  the  Libvan  bombinss. 
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Examples  of  Cold-war  alternatives  might  include: 

•  the  banning  of  trade  with  China  after  the  Communist  takeover, 

•  the  SALT  Talks, 

•  and  any  passive  act  including  the  withdrawal  of  military  forces  or  rrulitar>'  aid. 

B.  A  DECISION  UNDER  RISK 

Two  elements  in  the  structure  of  a  decision  are  the  set  of  alternatives  available  to 
the  decision  maker  and  a  set  of  factors  (over  which  he  has  no  control)  influencing  the 
decision  outcome.  These  latter  elements  are  called  futures  or  future  states  of  nature.  .A 
decision  under  ceriainiy  is  one  in  which  we  assume  that  one  and  only  one  future  state 
will  occur.  A  decision  under  uncertainty  is  one  in  which  we  are  unable  to  estimate  the 
probabilities  of  these  events.  A  decision  under  risk  is  one  in  which  we  can  estimate  the 
probabilities  of  these  future  states  and  more  than  one  of  them  is  greater  than  zero. 
[Ref  2J  The  relative  payoffs  for  different  outcomes  in  this  2x2  decision  under  risk 
matrix  are  not  the  same  for  the  United  States  and  the  Soviet  Union.  Therefore,  this 
situation  cannot  be  viewed  as  a  two-person  zero-sum  game.  However,  the  conflict  of 
interest  situation  could  be  viewed  as  a  statistical  decision  problem  if  the  frequency  of 
employment  of  peaceful  (or  aggressive)  actions  by  the  Soviet  Union  was  regarded  as  a 
random  state  of  nature.  If  the  proportion  of  actions  that  are  peaceful  by  the  U.S.S.R. 
could  be  described  probabilistically  and  the  advisors  in  the  State  Department  and  the 
Joint  Chiefs  of  Staff  could  estimate  that  distribution,  then  a  good  or  reasonable 
estimate  could  provide  the  President  with  a  statistical  foundation  for  making  a 
decision.  The  decision  maker,  in  order  to  reduce  the  risks  of  selecting  alternatives, 
would  like  to  know  what  proportion  of  actions  by  the  Soviet  Union  are  peaceful. 

The  decision  chosen  for  a  course  of  action  by  the  United  States  could  be 
deternuned  using  a  decision  rule  that  depended  in  part  the  probability  distribution  that 
was  used  to  describe  the  Soviet  Union's  (or  any  antagonist's)  past  behavior.  This 
proba'oility  distribution  might  initially  be  a  subjective  one,  with  which  the  United 
States  (or  any  protagonist)  conjectured  as  to  best  describing  the  antagonist's  behavior. 
This  probability  distribution  is  known  as  an  a  prion  probability  distribution. 

C.  A  PRIORI  TO  A  POSTERIORI  PROBABILITY  DISTRIBUTION 

.After  the  establishment  of  the  a  priori  probability  distribution,  the  advisors  could 
observe  tlie  conduct  of  the  antagonist  and  interpret  specific  events  as  associated  with 
one  of  the  two  possible  courses  of  action.  These  observations  may  then  be  used  to 
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update  the  previously  conjectured  probability  distribution.  Bayes  Theorem  provides  a 
method  for  updating  the  a  priori  probability  distribution.  In  general  terms,  Bayes 
Theorem  uses  an  a  priori  probability  function  and  derives  an  a  posteriori  probability 
function  given  the  sample  values  of  a  random  variable.  After  the  sample  values  are 
known,  the  a  posteriori  probability  function  summarizes  the  a  priori  and  sample 
information  about  the  proportion  of  actions,  by  the  antagonist,  that  are  peaceful. 
Bayes  Theorem,  thus,  improves  the  assessment  of  the  uncertainty  as  to  which 
conjectured  course  of  action  the  antagonist  is  pursuing.  The  changed  probabilities  are 
known  as  a  posteriori  probabilities.  These  become  the  present  estimates,  by  the 
protagonist,  of  the  probabilities  that  the  antagonist  is  using  one  or  the  other  specific 
course  of  action.  These  a  posteriori  probabilities  then  become  the  a  priori  probabilities 
in  further  estimates  of  the  antagonist's  behavior. 

D.  DECISION  RULES  FOR  THE  PROTAGONIST 

Once  an  a  posteriori  probability  distribution  has  been  established,  statistical 
decision  theory-  may  then  be  used  to  choose  objectively  an  appropriate  alternative 
through  the  use  of  a  decision  rule  that  is  based  on  the  matrix  payolfs  and  greatest 
expected  value  of  return.  This  decision  rule  will  assist  the  decision  maker  as  to  which 
strategy  to  pursue  given  any  past  history  of  the  antagonist's  behavior. 

The  particular  ordering  of  outcomes  in  terms  of  preferences  selected  and  the 
values  that  the  protagonist  assigns  to  these  outcomes  is  an  important  step  in  the  use  of 
this  model.  .As  wiil  be  seen  later  in  the  general  development  of  the  decision  rule,  this 
process  can  be  the  sole  determining  factor  by  which  the  model  proposed  here  will 
indicate  an  aggressive  or  a  peaceful  action  be  taken  by  the  protagonist. 

The  model  in  this  thesis  asks  a  decision  maker  to  choose  an  ordering,  from  the 
most  desirable  to  the  least  desirable,  of  four  possible  outcomes  which  show  how  each 
course  of  action  for  the  protagonist  (the  matrix  rows)  combines  with  a  course  of  action 
for  the  antagonist  (the  matrix  columns)  into  four  possible  outcomes.  Then  it  asks  him 
to  quantify,  on  an  interval  scale,  the  desirability  of  two  of  these  outcomes  relative  to 
f.vo  standards.  Next,  a  judgement  is  required  stating  which  of  many  conjectured  a 
priori  probability  distributions  he  believes  is  the  best  description  of  the  antagonist's 
behavior.  Based  on  the  a  priori  distribution  and  observation  of  actual  antagonist 
behavior  an  updated  a  posteriori  probability  distribution  is  established.  Given  the  a 
posteriori  probability  distribution,  which  wiil  not  be  the  same  for  different  occurrences 
of  antagonist  behavior,  a  decision  rule  is  derived  based  on  greatest  expected  value  of 
return. 


E.  THESIS  OUTLINE 

Theor\'  and  general  model  development  will  occur  over  the  next  five  chapters. 
Chapter  II  will  discuss  the  specific  categorizations  of  actions.  It  will  show  how  these 
actions  are  combined  into  outcomes  (or  states)  in  what  will  be  called  a  conllict 
situation  matrix.  Chapter  II  will  also  include  an  explanation  of  the  ranking  and 
quantification  of  these  outcomes.  Chapter  III  will  discuss  the  optimal  solutions  to  the 
decision  problem  represented  by  the  conflict  situation  matrix.  Chapter  IV  will  show 
how  to  estimate  the  antagonist's  behavior  using  a  conjectured  a  priori  probability 
distribution  and  Bayes  Theorem.  Chapters  V  and  VT  will  show  the  difference  between 
using  a  discrete  or  a  continuous  a  priori  probability  distribution  and  the  decision  rules 
that  result.  Chapter  VII  contains  specific  examples.  Chapter  VTII  contains  concluding 
remarks.  We  turn  now  to  a  discussion  of  the  general  two  alternative,  two  states  of 
nature  conflict  situation  matrix  that  is  the  basis  for  the  model  in  this  thesis. 


II.  CONFLICT  SITUATION  MATRIX 


This  chapter  begins  with  a  definition  of  the  two  alternative  courses  of  action 
available  to  the  protagonist  and  the  antagonist.  We  will  see  how  these  strategies  result 
in  four  possible  outcomes  or  states,  in  a  conflict  situation  matrLK.  to  which  the  decision 
maker  assigns  utility  values.  This  chapter  will  conclude  with  a  discussion  of  how  these 
assigned  utility  values  relate  to  each  other  on  an  interval  scale. 

A.  ALTERNATIVE  COURSES  OF  ACTION 

Imagine  a  situation  involving  a  conflict  of  interests  between  two  nations 
designated  as  the  protagonist  (P)  and  the  antagonist  fA).  The  protagonist  will  be  the 
nation  making  the  decision  in  the  face  of  uncertainty.  The  protagonist  has  a  set  of  two 
possible  courses  of  action  that  it  can  pursue.  The  set  of  possible  courses  of  action  for 
the  antagonist  also  consists  of  two  different  actions.  These  courses  of  action  are 
denoted  as  or  P^  for  the  protagonist,  and  or  Aj^  for  the  antagonist.  Let  the 
courses  of  action  for  the  nations  involved  have  the  following  meaning: 

P^  -  cold  war.  non-aggressive  or  peaceful  action, 

P[^  -  hot  war,  aggressive  or  violent  action, 

.A^  -  cold  war.  non-aggressive  or  peaceful  action, 

.Aj^  -  hot  war,  aggressive  or  violent  action. 

The  four  possible  situations  which  can  then  occur  as  the  nations  involved  exercise  their 
options  may  be  denoted  as; 

(P,,.A,).  (P,.Ah).  (Ph.A,)  and  (Ph-Ah). 

The  pair  fP^..A^)  describes  the  condition  of  both  nations  pursuing  peaceful  actions 
towards  each  other.  The  pair  (P^,..A|,^)  denotes  the  protagonist  displaying  either  peaceful 
or  nor.-vioient  actions  towards  the  aggressor  while  the  antagonist  is  displaying 
aggressive  or  hoi-^var  like  actions.  The  pair  iP[^..A^)  is  the  reverse  of  the  previous 
situation  and  (Pj^..Aj^)  is  both  nations  acting  hot-war  like  or  displaying  aggressive 
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actions  towards  each  other.  These  two  alternative  courses  of  action  for  both  the 
protagonist  and  the  antagonist  could  be  represented  as  a  2  x  2  matrix.  The  two 
columns  are  the  two  possible  behaviors  the  antagonist  can  exhibit,  and  the  two  rows 
would  be  the  two  alternative  courses  of  action  available  to  the  protagonist.  The  four 
ceils  of  this  matrix  are  the  four  outcomes  that  can  occur.  A  generalized  representation 
of  this  conflict  situation  matrix  is  shown  in  Figure  2.1. 


Pr  +1 


b  -1 


Ficure  2.1  Conflict  Situation  Matrix. 


B.  RANKING  OF  OLTCOMES 

■A  decision  maker  must  select  an  alternative  act,  course  of  action  or  strategy  from 
a  recognized  set  of  decision  alternatives.  The  connection  between  decisions  and 
rankings  of  outco.mes  (or  preferences)  in  any  decision  scenario  is  made  by  assuming 
that  preferences  are  the  driving  force  behind  decision  choices.  That  is.  it  is  assumed  a 
decision  maker  would  rather  im.piement  a  course  of  action  that  led  to  a  more  preferred 
outcome  over  one  that  led  to  a  less  preferred  outcome.  (Ref.  3) 


Suppose  for  a  moment  that  the  preferences  chosen,  by  the  protagonist,  for  the 
four  possible  outcomes  are  represented  as 

V(P,,A,'  >  V(P,,.A,)  >  V(Ph.A^)  >  V(P,.Ah), 

with  the  meaning  that  the  situation  to  the  left  of  the  symbol  "  >  "  is  preferred,  by  the 
protagonist,  to  the  situation  to  the  right  of  the  symbol,  and  the  notation,  V(  ),  is  the 
value  of  that  outcome.  These  and  all  orderings  of  outcomes,  by  the  protagonist,  shall 
be  subject  to  the  general  requirements  of  utility  theory^  in  that  they  are  transitive.  That 
IS.  if  .A  >  B  and  B  >  C  then  A  >  C  or  if  the  protagonist  is  indifferent  between  A  and 
B  and  is  indifferent  between  B  and  C  then  he  is  indifferent  between  A  and  C.  This 
establishes  the  notion  of  coherence  in  decision  theory  or  transitivity  in  the  laws  of 
mathematics.  [Ref  4:  p.  74] 

Notice  that  rankings,  other  than  the  one  suggested  above,  could  occur.  There 
may  be  situations,  in  the  eyes  of  the  decision  maker,  where  being  at  war  while  his 
antagonist  is  not.  (P^.A^),  is  preferred  to  both  being  at  peace,  (Pj,,A^),  so  that 
V(P^,A^)  is  greater  than  V(P^,A^).  .Another  possibility  may  arise  in  the  case  of  a 
political  decision  maker  looking  for  justification  of  aggressive  maneuvers  against  a  less 
capable  antagonist.  He  might  prefer  the  antagonist  to  take  aggressive  actions  towards 
him  first.  (P^..A[^).  to  all  other  outcomes,  thus  justifying  a  preference  of  (Pj^..Aj^).  to  all 
other  outcomes  later  on. 

C.  UTILITY  VALUES  FOR  THE  OUTCOMES 

Once  an  ordering  of  possible  outcomes  or  preferences  has  been  selected,  a 
quantification  of  those  outcomes  needs  to  be  determined  by  the  protagonist  in  order  to 
work  Within  the  framework  of  this  model.  As  described  in  Fundamentals  cf  Decision 
Analysis  by  Irving  H.  LaValle,  [Ref  4i  p.  72]  utility  values  assigned  reflect  the  relative 
desirabilities  of  the  possible  outcomes.  .A  generalized  utility  value  assignment  of  the 
outcomes  in  this  model  might  be  indicated  as  follows: 

V(P^.A^)  =  ^  1.0 
VrP,^.A[^)  =  -  1.0 
V(P^...Ah)=  a 
V(P;,.A^.)  =  b. 
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Note  that  this  utility  scale  uses  two  specific  outcomes  as  references,  whereas  the  usual 
procedure  assigns  the  value  of  zero  to  the  least  preferred  outcome  and  a  value  of  one 
to  the  most  preferred  outcome.  The  two  representations  are  merely  linear 
transform.ations  of  each  other  and  are  equivalent  in  information  content.  [Ref  5]  The 
meaning  of  the  value  of  -  1.0  attached  to  is  a  standard  of  preference  for  a 

desirable  situation  -  both  nations  at  peace.  The  value  of  -  1.0  attached  to  V(P^.Aj^)  is 
a  standard  of  preference  for  an  undesirable  situation  -  both  nations  at  war.  In  the 
example  considered  in  the  previous  section  a  has  a  value  less  than  -  1.0  and  b  has  a 
value  less  than  +  1.0  and  greater  than  -  1.0.  .Vlore  succinctly,  for  this  example  of 
preferences: 

V(P^.A,)  >  V(P^.A,)  >  V(P^.A|,)  >  V(P^.Ah), 
or 

1.0  >  b  >  -  1.0  >  a. 

The  assignment  of  reference  values  to  the  outcomes  (P^-.A^,)  and  (Pj^.Aj^)  of  1.0  and 
-  1.0  forces  the  V(P^,Ag)  to  ahvays  be  greater  than  V(Pj^.Aj^)  in  this  model.  This 
modeling  restriction  allows  12  possible  orderings  of  outcomes.  Which  ordering  of 
outcomes  the  protagonist  selects  will  ultimately  affect  the  alternative  course  of  action 
he  should  pursue  to  gain  his  greatest  expected  value  of  return.  The  utility  values  a  and 
b  can  be  any  arbitrary  real  numbers  for  the  situations  (P^^.Aj^)  and  (Pj^..-\^,).  Each 
numerical  value  is  the  i'liliiy  to  the  protagonist  of  each  possible  situation.  .-\ 
generalized  matrix  representation  of  the  conflict  situation  with  associated  utility  values 
assigned  by  the  protagonist  would  appear  as  previously  illustrated  in  Figure  2.1. 

D.  INTERVAL  SCALING 

The  utility  values  of  the  conflict  situation  matrix  are  on  an  interval  scale.  Interval 
scales  have  no  natural  origin.  There  is  a  zero  point  on  this  scale  but  it  is  a  iiej'.ntiJ 
point,  while  a  natural  origin  would  indicate  the  absence  of  the  property  being 
measured.  [Ref  6]  IfV-  is  the  utilitv  value  ofi^*^  outcome,  then  transformations  of  the 
type  h'  =  m(V|)  -  c  (linear  with  a  positive  slope)  are  allowed.  Due  to  the  lack  of  a 
natural  origin  on  the  interval  scale,  one  \alue  of  V-  cannot  be  said  to  be  a  proportion 
of  another  value  on  that  scale.  Wiicn  using  interval  scales,  one  may  only  speak  of 
relative  d':;J^rences.  For  example,  the  decision  maker  would,  after  quantiiication,  want 


to  be  able  to  say  that  he  believes  that  one  outcome,  relative  to  the  worst  outcome  is  p 
times  as  good  as  another  outcome  relative  to  the  worse.  In  other  words,  if  Vj  is  the 
value  for  outcomes  i  =  1,2,3  or  4  and  Vj  >  \\  >  \'^  >  V^.  It  could  then  be  stated 
that: 

-  ^'4 

P  =  - 

^'2  -  ^4 

This  would  imply  that  the  difference  between  outcomes  Vj  and  is  p  times  as  good 
as  the  difference  between  outcomes  V,  and  V^.  This  is  the  basic  information  contained 
in  an  interval  scale. 

In  this  chapter  we  have  introduced  the  conflict  situation  matrix  (Figure  2.1) 
showing  the  possible  courses  of  actions  available,  their  resulting  outcomes  and  the 
concept  of  utility  values  associated  with  these  outcomes.  The  utility  values  assigned  are 
rated  on  an  interval  scale  relative  to  two  standards  of  preference.  In  Chapter  III  we 
will  discuss  the  optimal  solutions  to  the  conflict  situation  matrix  using  dominance  and 
greatest  expected  value  of  return  when  the  decision  is  made  with  knowledge  about  the 
proportion  of  antagonist  actions  that  are  peaceful. 


III.  OPTIMAL  SOLUTIONS  WITH  CERTAINTY  ABOUT  THE 
PROPORTION  OF  ANTAGONIST  ACTIONS  TH.AT  ARE  PEACEFUL 


The  uncertamty  in  any  decision  making  process  may  usually  be  expressed  by 
saying  that  we  do  not  know  what  event  will  occur  on  some  future  occasion,  where 
events  are  defined  as  subsets  of  a  sample  space.  (Ref.  7] 
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Figure  3.1  Conflict  Situation  .Matrix  Under  Risk. 

In  this  model,  the  sample  space  i.*;  the  set  of  values  allowed,  for  the  proportion  of 
antagonist  actions  that  are  peaceful.  An  event  about  which  we  are  informed,  as  to 
whether  it  will  occur  or  not.  is  called  a  certain  event.  Otherwise,  it  is  referred  to  as  an 
uncertain  e\ent.  [Ref.  8:  p.  S) 
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In  this  chapter  we  look  first  at  the  case  where  the  decision  maker  is  certain  about 
the  proportion  of  actions  by  the  antagonist  that  are  peaceful.  In  particular  we  define  6 
as  the  proportion  of  actions  by  the  antagonist  that  are  peaceful.  Since  the  decision 
maker  is  certain  as  to  the  value  of  0,  this  implies  Pr(0  =  0)  =  1.0,  where  0  is  a 

realization  of  the  random  variable  0.  In  the  notation  used  here.  denotes  a  peaceful 
(coUi-war)  act  and  (its  complement)  denotes  an  aggressive  {hot-war)  act.  The 
proportion  of  actions  that  are  not  peaceful  is  (I  -  0).  since  Aj^  is  the  only  other  act  that 
can  occur.  The  conflict  situation  matri.x  under  risk  is  shown  in  Figure  3.1. 

A.  DOMINANCE 

From  the  decision  matrix  it  is  clear  that  if  a  is  greater  than  or  equal  to  -  1.0.  and 
b  is  less  than  or  equal  to  +  1.0,  then  the  decision  maker  will  And  P^,  optimal  by 
dominance  regardless  of  the  value  of  0.  Similarly,  when  b  is  greater  than  or  equal  to 
1.0,  and  a  is  less  than  or  equal  to  -  1.0.  Pj^  will  be  optimal,  regardless  of  the  value  of 
0.  In  each  case  he  will  be  indifferent  when  the  equalities  hold,  again,  regardless  of  the 
value  of  0.  Regions  when  the  values  of  a  and  b  lead  to  dominance  are  seen  in  the 
shaded  regions  in  Figure  3.2. 

Considering  that  the  conflict  situation  matrix  in  this  model  sets  the  utility  values 
of  outcomes  (P^,Aj^)  and  (P^,,.Ag)  equal  to  -  1.0  and  +  1.0  respectively,  we  find  that 
there  are  12  possible  ordering  of  preferences.  Five  of  these  orderings  are 


1)  V(P^.Ac)  >  V(Pj^.A^,)  >  V(P^.Ah)  ^  V(Pj^.Aj^) 
or 

+  1.0  >  b  >  a  >  -  1.0. 

2)  V(P^,Av,)  ^  V(P^.A^)  >  ViPj^.A^)  ^ 
or 

a  >  +  1.0  >  -  1.0  >  b, 

3)  VfP^.A^)  >  VtP^.Ah)  >  V(P^.A^,)  >  V(Pj^.Aj^) 
or 

^  1.0  >  a  >  b  >  -  1.0. 

4)  VfP^.A,)  >  ViP^.A;,.  >  V,Pi^.At,)  >  V,Pj^.A,) 
or 

-  1.0  >  a  >  -  I.O  >  1). 


Figure  3.2  Graph  of  Decision  Rule  Equation  with  Dominance  Regions. 


V(P^.Ahl  >  VVP^.A,,  >  V.P|,.,A,I  >  V,P|,,.A,,I 

or 

a  >  -^  1.0  ^  b  ^  -  1.0. 

ThCiC  r;\c  preference  patterns  all  have 

b  ^  ^  1.0  and  a  ^  -  1.0. 

They  all  wu!  find  aliernative  optimal  by  dominance. 

Only  one  of  the  remaining  seven  preference  selections  also  exhibits  the 
characteristic  of  dominance.  This  particular  ordering  of  outcomes  is  dominated  by  an 
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aggressive  response,  alternative  for  the  protagonist.  This  preference  pattern  is 


6)  V(Pj^,Ac)  >  V(P^.Ac)  >  V(Ph.Ah)  ^  V(P^.Ah) 
or 

b  ^  +  1  >  -  1  >  a, 

and  always  has 

b  ^  +  1.0  and  a  S  -  1.0. 

This  preference  choice  is  where  the  decision  maker  would  prefer  1)  acting  hot-war  like 
while  his  antagonist  was  not,  to  2)  peaceful  conditions  which  would  be  preferred,  to  3) 
total  war  which  would  be  preferred,  to  4)  being  peaceful  while  the  antagonist  acted 
aggressively.  In  this  case  the  decision  maker  would  always  find  P]^  optimal  by 
dominance. 

How  is  the  protagonist  to  decide,  when  there  is  no  dominance,  what  course  of 
action  is  most  advantageous  to  himself,  given  that  the  value  of  0  is  a  certainty'^  One 
basis  of  choice  he  might  use  is  to  select  the  alternative  that  gives  him  the  greatest 
expected  value  of  return. 

B.  EXPECTED  VALUE  AS  A  BASIS  FOR  CHOICE 

The  expected  value  from  the  conflict  situation  matrix  for  alternatives  P^.  is 

0  +  a(  I  -  9) 

and  for  alternative  Pj^,  is 

b0  -  (1  -  0). 

The  decision  maker  would  need  only  select  the  alternative  with  the  highest  expected 
value,  that  is,  whichever  of  the  two  quantities  above  is  the  maximum.  Since  the  values 
from  the  conflict  situation  arc  utility  values,  the  protagonist  would  be  selecting  the 
alternative  with  the  highest  expected  utility, 

.-\  decision  rule,  for  the  options  presented  in  this  matrix,  can  be  defined  as  the 
assignment  of  a  particular  course  of  action  for  any  value  of  0.  remembering  that  0  is 
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the  proportion  of  actions  by  the  antagonist  that  are  peaceful.  We  can  create  such  a 
decision  rule  by  setting  the  two  possible  decisions'  expected  value  quantities  equal  to 
each  other.  We  find,  after  simplifying  and  rearranging,  that  we  have  the  following 
linear  relationship  between  a  and  b. 


a  =  b(0  1  -  0)  -  (1,  1  -  0). 


(eqn  3.1) 


where  (0  1  -  0)  is  the  slope  of  the  line  and  (1  1  -  0)  is  its  intercept.  This  relationship  is 
a  family  of  lines  that  passes  through  the  point  a  =  -  1.0  and  b  =  -i-  1.0  for  all  values 
of  0,  0  <  0  <  1.0.  Equation  3.1  is  plotted  in  Figure  3.2  for  three  values  of0. 

C.  MAXIMIZING  EXPECTED  UTILITY  IN  THE  NON-DOMINATED  CASES 

We  now  turn  to  a  discussion  of  the  remaining  six  possible  orderings  that  do  not 
exhibit  the  characteristic  of  dominance.  This  is  where  the  decision  maker  always  rates 

a  >  -  1.0  and  b  >  +  1.0 
or 

a  <  -  1.0  and  b  <  +  1.0. 

Two  examples  of  these  preference  patterns  are 

V(Pj^.A,)  >  >  V(P^.Ah)  >  V(Pj^.Ah) 

or 

b  >  -  1  >  a  >  -  1, 


8)  V(P^.A^)  >  V(Pj^.A^)  >  V(Pj^,A^)  >  V(P^.Ah) 


-  1  >  b 


The  decision  maker  will  find  in  the  non-dominated  cases  that  expected  utility  will 
be  maximized  by  choosing  if 

a  >  b  (0  •  1  -  0)  -  (  1  1-0),  (eqn  3.2) 

or  by  choosing  if 

a  <  b  (0  ■  1  -  0)  -  (  1  1  -  0).  (eqn  3.3) 

He  will  be  indifferent  when  the  equality  holds  (Equation  3.1).  Equations  3.2  and  3.3 
gives  the  protagonist  a  well  defined,  unambiguous,  decision  rule  that  determines  what 
action  he  should  take,  given  a  known  value  of0.  In  Chapter  V  we  will  look  at  how  this 
type  of  decision  rule  can  be  used  when  the  value  of  0  must  be  estimated.  However, 
first  we  look  at  how  to  estimate  the  value  of  0  using  conjectured  a  priori  probability 
distributions  and  a  likelihood  function  in  Bayes  Theorem  to  provide  reasonable 
estimates  of  0. 


IV.  ESTIMATING  THE  UNCERTAINTY  ABOUT  THE  PROPORTION  OF 
ANTAGONIST  ACTIONS  THAT  ARE  PEACEFUL. 


L'kimately.  the  protagonist  must  choose  an  action  based  on  his  estimate  as  to 
which  course  of  action  the  antagonist  is  pursuing.  In  the  last  chapter  we  discussed  the 
case  where  the  protagonist’s  choice  would  be  that  course  of  action  which  gives  him  the 
greatest  expected  value  of  return.  We  also  described  the  case  where  the  proportion  of 
actions  by  the  antagonist  that  were  peaceful  was  known,  or  that  the  value  of  Q  was  a 
cenaimy.  What  if  protagonist  has  no  information  or  at  best  limited  information  on  the 
course  of  action  the  antagonist  is  pursuing?  In  this  chapter  we  will  see  how  to  estimate 
the  antagonist’s  future  behavior  using  Bayes  Theorem. 

A.  ANTAGONIST  ACTIONS  AS  BERNOULLI  TRIALS 

One  way  for  the  protagonist  to  gather  information  in  order  to  estimate  6  might 
be  to  observe  the  antagonist’s  behavior  and  from  these  observations  estimate  what 
course  of  action  he  is  pursuing.  The  two  types  of  actions.  (.A^.  or  .A^),  will  be  viewed 
as  two  mutually  e.xclusive  and  e.xhaustive  events,  in  the  sense  that  every  action  (or 
behavior)  observed  can  be  categorized  as  peaceful  or  aggressive.  [Ref  9)  .A  random 
variable  =  i  could  then  be  defined,  where: 

I  0  if  an  aggressive  behavior  is  observed 


I.O  if  a  peaceful  behavior  is  observed 


and  n  is  the  n^^  behavior  observed.  These  observations  of  tne  behaviors  can  be  viewed 
as  the  occurrence  of  a  Bernoulli  random  variable  with  parameter  0.  We  wouid  then 
have; 


PriXj^  =  1)  =  .0)'  (1-0) 


1-1 


( eqn  4. 1 1 


Our  problem  considers  the  case  where  there  is  also  uncertainty  as  to  the  value  of 
the  parameter  0.  The  protagonist  doesn't  know  what  course  of  action  or  mixture  of 
courses  c!'  action  will  occur.  He  mav,  however,  be  willing  to  make  some  5ub;ecti\e 
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guesses  or  conjectures  as  to  what  values  of  6  best  describe  the  proportion  of  actions, 
by  the  antagonist,  that  are  peaceful.  This  would  make  our  Bernoulli  trial,  the 
Pr(Xj^  =  i),  conditional  on  that  proportion.  6.  Therefore,  we  define  the  probability 
that  the  antagonist  displays  either  peaceful  or  aggressive  behavior,  given  that  the 
proportion  of  actions  that  are  peaceful  (cold-war  like)  is  6,  as: 


PnXj^  =  liG)  =  {0)Ml  -0)^-'. 


(eqn  4.2) 


B.  A  PRIORI  PROBABILITY  DISTRIBUTIONS  ESTIMATE  UNCERTAINTY 

The  initial  estimates  about  the  proportion  of  antagonist's  actions  that  are 
peaceful.  0,  maybe  subjective  on  the  part  of  the  protagonist  or  may  come  from  historic 
data.  In  the  discrete  case,  we  can  generalize  these  initial  estimates  as: 

Pr  (0  =  0^). 

For  example,  the  protagonist  might  conjecture  that  the  antagonist  acts  peacefully  only 
a  small  proportion  of  the  time  and  might  represent  his  uncenainy  about  the  value  of  0 


Pr  (  O  =  .99  )  =  .05 


Pr  (  O  =  .10)  =  .95. 

The  probability  distribution  function  of  O  has  the  properties,  that  the 

Pr(0  =  0J.)  >  0  and  Pr(0  =  0j.)  =  1.0 

for  r  =  1.2,3,....  where  0^.  are  the  possible  values  for  the  parameter.  Both  the  discrete 
and  the  continuous  cases  will  be  considered  in  this  thesis.  In  the  continuous  case,  the  a 
prinri  probability  distribution  will  be  denoted  as  f  (0). 


For  the  discrete  case  only  certain  values  for  0^.  in  the  interval  (0,1)  are  allowed. 
The  interval  (0.1)  is  intentionally  displayed  as  an  open  interval  because  attaching 
probabilities  of  0  or  1.0  to  uncertain  events  can  lead  to  intractable  conclusions 
suggested  by  Cromwell's  rule.  This  will  be  discussed  in  detail  in  the  next  section  on 
Bayes  Theorem. 

These  probability  functions  represent  the  protagonist's  a  prion  estimates  of  0. 
Before  or  without  the  introduction  of  more  information,  these  a  prion  estimates  of  the 
probability  distributions  are  the  only  estimates  the  protagonist  has  available  in  this 
model.  However,  after  the  protagonist  observes  the  antagonist's  behavior,  defined 
above  as  Xj^=i  where  i  =  0  or  1.0,  a  resulting  a  posieriori  probability  distribution 
conditioned  on  the  value  of  0  is  given  by  Bayes  Theorem. 

C.  BAYES  THEORE.M 

Ultimately  we  want  to  show  that  the  conditional  probability  distribution  being 
considered,  the  probability  that  the  proportion  of  actions  that  are  peaceful,  or 
Pr(  0  =  0  ),  given  his  most  recently  observed  behavior  is  =  i.  where 

=  0  -  occurrence  of  an  aggressive  behavior  {hot-war) 

Xj^  =  I  -  occurrence  of  a  peaceful  behavior  {cold-war), 

is  given  in  this  thesis  as: 

Pr(0  =  0^)  Pr(Xj^  =  i!  0^) 

Pr(O  =  0^iXj^  =  i)  =  - ,  r=  1.2.3,...  (eqn4.3) 

Yj  Pr(0  =  0^)  Pr(Xj^  =  ij  03) 

[Ref  10]  We  begin  with  the  definition  of  conditional  probability,  which  is  given  as: 

Pr(0  =  0^  n  Xj,  =  1) 

P(0  =  0r  1  Xj,  =  i)  =  - . 

RXn  =  1) 

In  order  to  obtain  the  unconditional  distribution  of  Xj^  =  i,  (the  denominator)  we 
invoke  a  widely  used  theor\'  in  the  study  of  stochastic  processes,  that  is,  the  Theorem 


of  Total  Probability.  [Ref.  11]  which  states,  that  if  Pr(Xj^  =  i  !  0^.)  for  r  =  1.2.3.,..  is 
defined,  then  we  have: 


Pr  (Xn  =  i)  =  Xr  =  ®r‘  <‘^n  ==  ' '  ®r>- 

The  conditional  probability,  or  in  Bayesian  terms  the  likelihood  of  occurrence  X^  =  i. 
was  defined  in  the  section  on  Bernoulli  trials  (Equation  4.2)  as 


Pr  (X^  =  i  I  0^)  =  10r)‘  (1  -  .  (eqn  4.2) 


We  can  also  shown  from  the  definition  of  conditional  probability  that: 


Pr  (0  =  0^  n  X^  =  i)  =  Pr  (0  =  0r)  Pr  =  i  |  0^). 


Substituting  these  into  our  defnition  of  conditional  probability,  we  have  our 
original  e.xpression: 


Pr  (0  =  0^)  Pr(X„  =  i|  0^) 

Pn  0  =  0j.,Xj^=  i)  =  - ,  r=  1,2,3....  (eqn  4.3) 

Pr(0  =  0J.)  Pr(Xj,  =  il  0^) 

.After  substituting  in  our  likelihood  function.  Equation  4.2  we  have  the  general  form  of 
the  Bayesian  conditional  a  pos'eriori  probability  function  used  throughout  this  thesis: 

Pr(0  =  0/1  (0^)'  (1  - 

Pn0  =  0j.|Xj^  =  i)  =  - .  r=  I.2.3....  (eqn  4.4) 

Xr  PnO  =  (0/‘  (1  -  0/*-‘ 

Thi?.  equation  and  its  continuous  cu'^c  analogue,  update  the  a  priori  probability 
function  into  an  a  posteriori  probabiiitc  I'uncticn  that  is  conditioned  on  the  proportion 


of  actions  that  are  peaceful  by  the  antagonist.  Equation  4.4  is  the  conditional 
probability  that  the  antagonist's  proportion  of  peaceful  actions  is  0^.  in  the  present 
conllict  situation  given  that  he  has  just  recently  been  observed  acting  aggressively  (for 
=  0).  This  is  the  result  of  the  a  priori  probability  of  the  proportion  of  peaceful 
actions  times  the  conditional  probability  that  the  antagonist  acted  aggressively  m  the 
past,  given  that  his  proportion  of  peaceful  actions  is  0j..  divided  by  the  sum  of  all  the 
ways  these  events  can  occur. 

The  random  variable  will  always  be  considered  as  a  discrete  random  variable, 
but  the  a  priori  probability  distributions  may  be  discrete.  Pr  (0  =  0^.).  or  continuous, 
f  (0).  For  the  case  where  it  is  continuous.  Bayes  Theorem  is; 


f(0 ;  =  1)  = 


f{0)(0y(i  -0)1-1 


I  f(0)(0)‘(l  -  0)1-1  d0 


(eqn  4.5) 


The  role  of  Bayes  Theorem  is  to  establish  a  connection  between  the  protagonist's 
a  priori  probability  estimates,  an  expression  of  his  uncertainty  that  the  antagonist  will 
act  peacefully.  Pr(0  =  0).  and  the  effect  that  additional  information  or  evidence  uill 
be  to  revise  these  probabilities.  If  the  protagonist  were  to  obtain  perfect  information, 
one  of  the  probabilities  would  go  to  +  l.O  so  that  the  Pr  (0  =  0)  =  +  1.0.  This 
would  be  the  one  corresponding  to  the  true  distribution  of  0.  The  remaining 
probabilities  would  go  to  zero.  Partial  or  less  than  perfect  information,  like  that 
obtained  through  the  model  proposed  here,  produce  a  less  marked  effect  on  the  a 
posteriori  probabilities  through  the  use  of  Bayes  Theorem. 

.w  result  that  follows  from  Bayes  Theorem  is  that  it  is  inadvisable  to  attach 
probabilities  of  zero  to  uncertain  events.  Uncertain  events  of  zero  probability  remain 
so.  no  matter  what  additional  information  is  provided.  Therefore,  the  decision  maker 
must  be  aware  of  the  possibility  that  he  may  be  mistaken,  for  if  he  thinks  that 
something  cannot  be  true  (e.g.,  peaceful  intentions  on  the  part  of  his  antagonist)  and 
interprets  this  as  having  zero  probability,  his  decisions  uall  never  be  influenced  by  any 
contradicton.’  information  which  is  surely  absurd.  This  result  is  known  in  the  study  of 
Bayesian  techniques  as  Cromwell's  Rule.  [Ref  S:  p.  104]  This  same  elTcct  holds  true, 
when  a  probability  of  1.0  is  assigned,  since  the  assignm^ent  of  a  probability  of  1.0  to  an 
uncertain  event  implies  the  probability  of  its  complement  is  zero. 
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Assuming  probabilistic  independence  of  the  successive  observations,  the  a 
posteriori  probability  after  any  set  of  observations  becomes  the  a  prion  estimate  for  the 
next  obser\'ation{s).  It  can  also  be  shown  that  for  any  number  of  observations,  the 
conditional  likelihood  function  in  Saves  Theorem  is 


Pr(Xj^=i!  er)  =  (ej.)'I^nMl 


(eqn  4.6) 


where  n  is  the  total  number  of  behaviors  occurring,  and  is  the  summation  of  the 
random  variable  X^^  =  i  for  i  =  0  or  1.0.  [Ref.  10]  In  this  thesis  we  will  deal  with  the 
behaviors  one  at  a  time,  with  specific  examples  worked  out  in  the  Chapter  VTI. 

In  this  chapter  we  introduced  the  concept  of  the  antagonist's  exhibition  of 
peaceful  or  not  peaceful  behavior  as  the  realization  of  a  Bernoulli  random  variable 
conditioned  on  the  proportion  of  antagonist  actions  that  are  peaceful.  This  Bernoulli 
trial,  combined  with  an  expert  s  conjectured  a  priori  probability  distribution,  are  used 
in  Bayes  Theorem  to  update  this  probability  distribution  into  a  current  estimate  of  the 
antagonist  s  intentions.  In  the  next  chapters  we  will  take  these  a  posteriori  probability 
distributions  for  the  discrete  and  the  continuous  cases  and  see  how  they  can  be  used  to 
create  decision  rules  based  on  maximizing  expected  utility. 
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V.  DISCRETE  CASE  DECISION  RULES  WITH  BAYES  THEOREM 

In  this  chapter  we  will  consider  the  discrete  case  for  the  a  prior:  probability 
distribution  of  the  proportion  of  antagonist  actions  that  are  peaceful,  9.  We  will  begin 
by  showing  a  general  decision  rule  for  the  decision  problem  represented  by  the  conflict 
situation  matrix.  This  will  be  folloifed  by  a  discussion  of  how  to  use  critical  values  and 
most  iikely  values  as  a  means  of  selecting  6.  This  chapter  concludes  with  a  section  on 
using  maximiUm  expected  utility  values  considering  uncertain  values  of  9.  Chapter  \'I 
Will  consider  the  continuous  cases  as  the  a  priori  probability  distributions. 

.4.  A  GENERALIZED  DECISION  RULE 

The  decision  rule  from  Chapter  III  for  ma.ximizing  expected  utility  value,  with  9 
certain,  is  to  choose  if 


a  b(9  I  -  9)  -  f  1  1-9), 


(eqn  3.2) 


and  if 


a  <  b{9  1  -  9)  -  (  I  1  -  9). 


ieqn  3.3) 


Returning  to  our  original  definition  of  expected  value  for  the  two  alternatives.  (P^,  or 


P',^).  in  the  conflict  situation  m.atrix  we  define 


9^  *  all  -9/)  =  E{P^.9^), 


b9r  -  il  -  9^)  =  E  iPh.9^). 

Here.  E  (P^.9j.)  is  the  expected  utility  value  for  alternative  P^,  given  any  particular 
value  cf  9.  .Also,  the  E  (Pj^.9j.)  is  the  expected  utility  value  for  alternative  Pj^.  gi\en 
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any  particular  value  of  0.  We  can  now  generalize  our  decision  rule  to  choose  the 
alternative  such  that  it  is 


MaXj  E(Pj.e^) 


(eqn  5.1 


& 


where  j  =  c  or  h  for  any  value  ofS^.. 

B.  CRITICAL  AND  MOST  LIKELY  VaLL  ES 

The  question  that  still  remains,  given  that  the  decision  maker  has  selected  a 
preference  pattern  (and  utility  values  for  a  and  b),  is  what  value  to  use  for  6,:’  The 
protagonist  might  select  a  critical  value  that  any  estimates  ofGj.  should  equal  or  exceed, 
and  !l'  he  had  the  option,  not  consider  a  value  of  0^.  until  its  probability  of  occurrence 
equalled  or  exceeded  this  stated  critical  value.  Suppose  the  decision  maker  selected  .95 
as  his  critical  value,  so  that  he  would  not  select  an  alternative  until  any  particular 
probability  estimate  equalled  or  exceeded  this  value.'  .After  the  Pr(0  =  0j.)  equals  or 
exceeds  .95  he  could  substitute  that  value  for  0j.  into  Equations  3.2  and  3.3,  and  based 
on  whichever  equation  holds,  choose  the  corresponding  alternative.  This  changing  of 
probabilities  is  best  demonstrated  with  an  example. 

Suppose,  that  the  protagonist  feels  that  the  antagonist  acts  peacefully  a  small 
proportion  of  times  and  that  he  acts  aggressively  a  large  proportion  of  times,  and  that 
he  suggests  this  a  priori  probability  distribution,  such  that 

Pr  (O  =  9  10)  =  14 


Pr  (O  =  1  4)  =3  4. 

Now,  suppose  that  the  protagonist  observes  peaceful  behavior  on  the  part  of  the 
antagonist,  or  the  value  of  =  1.  Recalling  Bayes  Theorem  (Equation  4.4)  I'rom 
Chapter  IV: 


0rt'0r)' (1  - 

Pr(0  =  0^:Xj^  =  i)  =  - .  r=  1,2,3 . 

Pn0=  0^)(0^)Ml -0r)^'‘ 


(eqn  4.4) 


Mt  IS  conceded  here  that  third  alternative,  that  of  doing  nothing  is  not  included 
in  the  conflict  situation  mati-.v.  The  critical  \alue  discussion  is  predicated  on  the 
existence  ci'this  third  alternative. 
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and  using  the  situation  of  our  example  we  find 


Pr.  0=  9  10  :  1] 


(1  4) (9  10) 


(1  4) (9  10)  -  (3  4) ( 1  4) 


=  ( IS  33)  =  .55. 


(3  4)(1  4) 


Pri0=  1  4  Xn=  1> 


=  (15  33)  = 


(3  4) i 1  4)  ^  (1  4) (9  10) 


•As  can  be  seen,  none  of  the  values  of  the  a  posienon  probability  distributions  exceed 
our  decision  makers  stated  cyitical  -.alue  of  .95.  So  let  s  say  he  observes  a  second 
behavior,  and  this  time  he  observes  aggressive  behavior  by  the  antagonist,  or  =  <.). 
\3'e  again  return  to  Bayes  Theorem  to  update  our  probability  distribution,  remembering 
that  the  a  priori  distribution  now  was  the  a  posteriori  distribution  above.  We  now  have 


(1S33)(1  10) 


Pri0  =  9  10  I  Xj^=  0)  = 


=  .138. 


(1S  33)(1  10)  -(  15  33)  (3  4) 


(15  33) (3  4) 


Pn0=  1  4  ;  Xj^=  0)  = 


=  .862. 


(15  33) (3  4)  -(18  33) (1  10) 


The  protagonist  might  continue  taking  observations  of  the  antagonist's  behavior  until 
one  of  the  probabilities  does  actually  exceed  his  stated  critical  value.  He  could  then  use 
this  value  of  0j.  in  Equations  3.2  and  3.3  to  choose  his  alternative.  'What  if.  after  one 
or  more  observations,  the  protagonist  finds  that  none  of  the  proposed  probability 
disiributions  show  estimates  that  exceed  the  critical  value  (as  in  the  example  above i  and 
a  decision  must  be  made  now  as  to  which  course  of  action  to  pursuel*  This  is  the  case 
when  the  conditional  a  posteriori  probabilities  have  not  achieved  a  sujriaeni  pr'.hahiluy 
to  exceed  any  stated  critical  value.  The  protagonist  might  then  use  the  notsi  l.kcly 
value,  or  the  maximum  Pr(0  =  0^.  X^^  =  i.t  and  use  this  value  of  0^.  m  Equations  3.2 
and  3.3.  to  select  the  alternative  with  the  maximum  expected  utility.  Which,  ai'ter  ’he 


first  observation  would  mean  that  he  would  have  used  0  =  .90.  since  its  probability 
was  the  maximum,  whereas,  after  the  second  observation  he  would  have  used  the  value 
of  0  =  .25  as  its  probability  of  occurrence  was  the  maximum  in  that  instance.  The 
decision  maker  could  also  use  this  criterion  to  select  a  value  for  0j.  before  the 
observation  of  any  antagonist  actions.  In  that  case  he  would  have  selected  0  =  .25. 

C.  M.WIMIZE  EXPECTED  UTILITY  VALUE  CO.NSIDERING  ALL 

PROPOSED  PARAMETER  VALUES 

The  procedures  just  described  for  selecting  0  have  a  weakness  since  they  do  not 
use  all  the  information  available  in  the  problem.  If  the  protagonist  is  interested  in 
maximizing  his  expected  utility  value  relevant  to  all  of  his  esiimaies  of  an  antagonist's 
behavior,  then  he  would  not  want  to  lose  any  information  available  to  him.  This  would 
be  the  case  in  selecting  the  mos!  likely  and  critical  lalue  criteria.  Therefore,  he  would 
wan:  to  consider  all  possible  parameter  values  proposed  and  the  relative  frequency  with 
which  the>  can  occur,  and  use  the  expected  value  of  return.* 

In  the  discrete  case,  the  protagonist  computes  the  expected  value  of  return  over 
0J.  if  he  uses  course  of  action  P^.  and  he  computes  the  expected  value  of  return  over 
these  same  proposed  values  of  0j.  if  he  uses  course  of  action  Pj^.  The  course  of  action 
he  uses  corresponds  to  whichever  of  these  two  values  is  the  maximum.  This  maximum 


value  IS 


.Ma.Xj  =  ®r  ■  •'^n 


(eqn  5.2) 


for  j  =  c  or  h  and  r  =  1.2...  This  is  the  Bayesian  decision  rule  relative  to  the  a 
posteriori  probability  estimates.  [Ref.  121 

Returning  to  our  discrete  case  example  above,  we  have  after  two  observations 
and  assunung  that  the  protagonist  had  selected  the  eighth  listed  preference  pattern 
from.  Cl'.apter  III, 

V(P^.A^.>  >  V(Pj^..A..)  >  V(P|^.Aj^i  >  ViP^.Af^f 

.:r 

1  >  b  >  -  1  >  a. 


•'rite  term  expected  value  of  return  is  not  to  be  interpreted  as  the  value  that  will 
be  obtained  by  the  protagonist  or.  any  single  occurrence  but  rather  as  the  expected 
salue  Gl' return  due  to  the  uncertaintv  of  the  value  of  6. 


• 


u’lth  b  =  0  and  a  =  -  3.0.  In  accordance  with  5.2,  we  seek  the  maximum  of 

-0^)  PnO  =  0,  :  =  i  ). 

or 

P’'*®  =  ®r  !  ^n  =  ‘  ^ 

;br  both  values  cfOj..  Thus  we  have 

/-3.0K1  10M.13S)  (-3)(3  4)(.862)  =  -1.98, 

or 

(0  -  1.0)(.13S)  +  (0  -  1.0K.S62)  =  -1.0. 

The  protagonist  would  choose  alternative  Pj^. 

This  procedure  of  selecting  alternatives  P^,  or  Pj^  is  based  on  preference  pattern 
'election,  utility  value  assignment,  Bayes  Theorem  and  the  use  of  the  decision  rule,  5.2. 
This  decision  rule  can  be  used  for  any  number  of  proposed  values  for  the  parameter  0j. 
thus  allowing  the  use  of  all  the  information  available  in  the  problem.  In  the  next 
chapter  we  wUI  look  at  the  Bayesian  decision  rules  that  result  from  three  dilTerent 
continuous  case  a  priori  probability  distributions. 


VI.  CONTINLOLS  CASE  DECISION  RULES  WITH  BAYES  THEOREM 

Rather  than  just  considering  the  a  priori  probability  distribution  to  be  finite,  we 
would  like  to  ccn'^'.der  the  case  where  there  are  an  infinite  number  of  values  for  0^. 
possible.  In  this  case,  the  general  Bavesian  decision  rule  for  the  continuous  case  a 
priori  probability  function  is  to  choose  Pj  such  that 

Ma.\,  l'  E  ( P,.6)  f  (6  X_  =  1  )  d6  (eun  6. 1 ) 

j  j  j  II 

for  j  =  c  or  h.  There  are  three  particular  cases  of  continuous  probability  functions 
considered  in  this  chapter,  the  first  case  being  that  all  values  in  the  range  of  0  are 
equally  likely. 

.A  DECISION  RULES  USING  A  UNIFOR.M  A  PRIORI  DISTRIBUTION 

The  uniform  case  presents  a  good  starling  point  for  the  protagonist  if  he  has  no 
information  (or  no  idea)  as  to  what  proportion  of  actions,  by  the  antagonist,  are 
peaceful.  The  assumption  that  ail  possible  values  of  6  are  equally  likely  suggests  an 
unbiased  viewpoint,  and  when  combined  with  Bayes  Theorem,  allows  the  distribution 
to  weight  future  probabilities  on  observed  behaviors  alone.  This  can  be  seen  by  the 
form  of  the  a  posteriori  probability  distributions  plotted  in  Figure  6.1.  In  this  section 
we  assume  the  protagonist  uses  a  uniform  a  priori  probability  distribution,  then  we 
show  the  two  a  posteriori  probability  distributions  that  result  after  an  observation  of 
each  type  of  behavior.  From  these  two  updated  probability  distributions  a  generalized 
decision  rule,  based  on  the  conflict  situation  matrux.  is  derived. 

The  protagonist  assumes  that  all  values  in  the  range  of  0  are  equally  likely  and 
uses  a  uniform  distribution  as  his  a  prion  distribution.  The  general  form  of  the  uniform 
distrfoution  is 


ft0)  =  1  (d  -  c).  c<0<d. 

The  limits  for  0  are  0  and  1.0.  so.  the  general  form  becomes 

ft0'  =  1(1-  0). 
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The  initial  input  into  Bayes  Theorem  is  this  conjectured  a  priori  probability 
distribution.  The  protagonist,  next,  observes  the  anatagonist  s  behavior.  There  are  two 
po'^sibuitics  for  this  random  sanahlc;  =  0  or  -  1.0.  If  he  observes  aggressive 
behavior  then  =  •'  and  the  a  posierovi  distribution  becomes 

ftQ  =  0)  =  2  -  20  .  (eon  6  2) 

This  result  can  be  seen  by  beginning  with  the  general  form  of  Bayes  theorem  stated  in 
Chapter  IV  as 

fiGiieiM  -  0i‘-‘ 

f  {0  ,  Xj^  =  i)  =  - .  (eqn  d.oi 

Jq  f(0i  (0)‘  tl  -  0)*-'  d0 

where  for  the  uniform  case  with  i  =  0  becomes 

(i.O)n  -  0)1 

f  (0  X^  =  0)  =  - . 

J'q  i  l  .0)<  I  -  0|'  d0 

which  simplifies  to 

f  (0  Xj^  =  0)  =  2  -  20  .  (eqn  6.2) 

If  the  protagonist  observes  peace! ul  behavior  on  the  part  of  the  antagonist,  then 
tlie  random  variable.  .X  =  *  1.0,  and  tlie  a  posterior  distribution  becom.es 


This  result  can  be  seen  by  beginning  with  the  general  form  of  Bayes  theorem  which 
with  1  =  l.O  becomes 


f(e  1  =  1,0.)  = - 

j'0  ( l.O)(0)  d0 


which  reduces  to 


f(0  '  =  1.0)  =  20  .  (eqn  6.3) 

The  eiTect  that  Bases  Theorem  has  on  the  a  priori  distribution  can  be  seen  in  Figure 
6.1.  where  all  three  (the  a  priori  and  the  two  resulting  a  posteriori)  distributions  are 
plotted. 

Both  of  these  a  posteriori  probability  distributions  are  legitimate  probability 
distributions  since 

k  -«  -  "> 

and 

Je:.ie=  1.0. 

1  I  niform  Case  Decision  Rule  for  Xj^  =  0 

The  a  posteriori  distribution  that  results  after  X^^  =  0  is 
1^0  X^j  =  (1)  =  2-20  (Equation  6.3).  The  decision  rule  is  to  choose  that 
m.iMnuzes  Jq  E  '  Pj.0t  f  (0)  d0.  in  accordance  with  6.1.  and  we  have  the  integral  below 
to  esaiuate.  The  decision  is  to  choose  alternative  P^,  if  the  quantity  on  the  left  exceeds 
the  quantits  on  the  right: 

J0  ■0-a0  -  aM2  -  20 'u0  •  10  (0^  b0  -  1  )(2  -  20)  d0. 

•Alter  integration  this  decision  rule,  in  the  case  for  a  uniform  a  priori  and  the 
ib^er'.  .itien  ofX^  =  0.  beco.mes  scicwt  P,  if 


(eon  6.4) 


which  reduces  to 


:a  3  -  1 


The  inteara!  on  the  rieht  is 


^0  -  be  -  1  m2  -  26)  d0  . 


which  equals 


(-0  :be  -  2  -  20*  -  2b0-)  de 


which  reduces  to 


b  3  -  2 


L’se  of  these  two  values  front  our  integrals  brings  us  to  our  decision  rules  of  choosing 
if  a  >  b  2  -  3  2  or  if  a  <  b  2  -  3  2. 


2.  Lniform  Case  Decision  Rule  with  =  1.0 

The  u  posteriori  distribution  that  results  after  1.0  is 

f(0  =  1.0)  =  20  (Equation  6.2).  Following  from  the  decision  rule.  6.1.  we  have 

the  integrals  below  to  evaluate.  Notice  that  only  the  a  posteriori  density  function  has 
changed.  The  decision  maker  would  choose  P^,  if 

Jq  (6  -  ae  -  a)  (20)  d0  >  J9  (0  ^  bB  -  1)  (20)  d0  , 
which  reduces  to  choosing  P-  if 


a  >  2b  -  3. 


( eqn  6.6) 


or  P[^  if 


a  <  2b  -  3. 


leqn  6. 


Tins  result  can  be  seen  by  comparing  the  value  of  the  integral  on  the  left  to  the  value 
of  the  intesral  on  the  naht.  The  intearal  on  the  left  becomes 


2]’0  (0)-  -  af0)-  -  a(0)  d0 


which  equals 


a  j  2  j. 


I'he  intesral  on  the  risht  becomes 


2Jq  (0r  ^  b(0r  -  (0)  d0 


which  eouals 


2b  3  -  13. 


These  two  values  reduce  to  the  decision  rules  of  Equations  6.6  and  6.7  to  choose 


alternarii'c  if  a  >  25-3  cr  if  a  <  2b  -  3. 


B.  DECISION  RLLES  USING  A  BETA  (a  =  2,  p  =  2)  A  PRIORI 

DISTRIBUTION 

.Another  conceivable  distribution  that  the  decision  maker  might  consider  as  best 
describing  the  antagonist's  behavior  is  a  symmetric  dome-shaped  curve.  This  curve 
uould  have  the  mean  equal  to  the  mode,  and  be  centered  at  0  =  .50.  Such  a  curve 
would  mean  that  the  protagonist  feels  that  most  of  the  probability  for  0  is  at  the  value 
,50.  This  would  mean  he  feels  it  is  most  likely  that  half  the  actions  of  the  antagonist 
arc  .iggressive  and  half  of  them  are  peaceful,  sloping  olT  towards  zero  probability  at  the 
extreme  values  of  0.  This  kind  of  subjective  feel  for  the  a  priori  probability  distribution 
can  be  captured  by  a  Normal  distribution,  such  that  it  is  N  ~  (.5,  (T“).  However,  the 
range  ai' the  Normial  distribution  is  -  tc  to  -  ^  and  the  range  of  the  parameter,  0.  is 
o  to  -  1,0.  .A  Beta  distribution  with  parameters  a  =  P>  1  captures  the  essence  of  a 
-•vnumctric  dome-shaped  curve  centered  at  .50,  and  it  iinuts  itself  to  the  range  of  our 


’  •  •  ^  TJ*  *  •  *•  ^  fc  '*  *  *  J»  '  ■  "jl  ^  a  •  a^  •  *•  a'* 


m\ 


random  variable.  The  probability  density  function  of  a  Beta  distribution  has  the  form: 

r(a  +  P) 

f(0)  =  - . 

Hainp) 

For  example,  when  a  =  p  =  2.0, 

r(4i 

f(0)  =  -  (0)(l-0). 

n2)r(2i 


f(0)  =  6(0X1  -  0). 


When  this  conjectured  a  priori  probability  distribution  is  used  in  Bayes  theorem  we  get 


6(0)(1  -0)(0)‘(l  -0)*-* 


f(e  =  i)  = 


6Jq  (0)  (1  -  0)  (0)‘  (1  -  0)‘-*  d0 

and  when  the  observation  of  the  antagonist's  behavior  yields  =  0,  this  equals 

6'  0  H  1  -  0)-*' 

6^0  (0 )( 1  -  0)'  d0 

The  denominator  is  (6)(Beta(2,3))  density  function  which  equals  6(1  12).  The  entire 
expression  for  the  a  posteriori  probability  distribution,  with  a  conjectured  a  prion 
distribution  of  a  Beta  (2.2).  and  a  subsequent  observation  ofX^^  =  0.  reduces  to 


f(0  X^  =  0)  =  12(0)(1  -  0)-  . 


(eqn  6.S) 


If  the  subsequent  observation  of  antagonist's  behavior  is  peaceful,  or  =  1.0, 
then  our  formula  from  Baves  theorem  becomes 


f(0  I  =  1.0)  = 


6{e)2(i  -  0) 


6J0  (0)2(1  -0)d0 


where  the  denominator  is  (6)(Beta(3,2))  density  function  which  also  equals  6(1  12). 
and  we  have 


f(0  I  X^  =  1.0)  =  12(0r(l  -  0)  . 


(eqn  6.9) 


Figure  6.2  Beta(2.2)  Case  -  A  Priori  and  A  Posteriori  Distributions. 


The  edect  that  Bayes  Theorem  has  on  the  a  priori  distribution  can  be  seen  in  Figure 
6.2  where  all  three  (one  a  priori  and  two  a  posteriori)  distributions  for  the  Beta(2.2) 
case  are  plotted. 

1.  Beta  (2.2)  Case  Decision  Rule  with  =  0 

The  decision,  after  observation  of  =  0,  is  to  choose  alternative  if 


j0  .e-ae-a)i:(0Hi-er  de  >  j‘Q(e^be-ni2ie)(  i-0)-  d0  . 

and  the  decision  rule  reduces  to  selecting  if 

a  >  2b  3  -53.  teqn  6.10) 


or  Pv 


if 


a  <  2b  3  -  5  3  .  (eqn  6.11} 

This  result  can  be  seen  by  comparing  the  value  of  the  integral  on  the  left  to  the  one  on 
the  right,  where  the  integral  on  the  left  is 

12(  l-a)j0  (0)^(l-0)"  d0  ^  I2aj0  (0Hl-0)‘  d0  . 


and  the  two  terms  reduce  to 

12(l-a){r(3)r(3)  r(6)}  +  12a{r(2)r(3)  r(5)}. 


which  equals 


s  -  ■> 


5. 


The  integral  on  the  right  becomes 


12(l  +  b)f0  (0)-(l-0)‘  d0  -  12j0  (0)(l-0)*  d0, 


i2(i  +  b)iT(3)r(3)  r(6)}  -  i2tr(2)r(3)  r(5)}. 


which  equals 


2b  5  -  3  5. 

and  the  two  values  reduce  to  the  decision  rules  of  Equations  6.10  and  6.11  to  choose 
P^ifa  >  :h3-3  5orPi,.ra  <  2b  3  .  3  3. 

2.  Beta  (2,2)  Case  Decision  Rule  with  Xj^  =  1.0 

The  decision  after  observation  of  =  1.0  is  to  choose  alternative  if 

Iq  (0{l-a)-a)12(e)“(l-e)  d0  >  J0  (0(  1  +  b)  -  1  )12(0)‘(  1-0)  d0  . 

and  the  decision  rule  reduces  to  selecting  if 

a  >  3b  2  -  5  2  ,  (eqn  6. 12 1 


or  Ph  if 


a  <  3b  2  -  5  2  .  (eqn  6. 1 3 ) 

This  result  can  be  seen  by  comparing  the  value  of  the  integral  on  the  left  to  the  one  on 
the  right,  where  the  integral  on  the  left  becomes 

12(l-a)j0  (0)^(l-0)  d0  +  12aj0  (0)-(l-0)  d0. 


which  reduces  to 


12fl-a)[n4ir(2)  ri6)l  -  12a(r(3)r(2)  n5'!. 


which  equals 


2a  5  3  5. 


The  integral  on  the  right  becomes 

12(1  -hiJq  lei-d-G)  ae  -  12J0  (erd-Gi  dG. 


giving 


12(1  ^b'in4)r(2)  r(6)|  -  i2ir(3)r(2)  nsii. 


which  equals 


3b  5  -  2  5. 

These  two  salues  reduce  to  the  decision  rules,  for  the  Beta(2.2)  case  and  Xj,  =  I,  of 
Equations  6.12  and  6.13  to  choose  ifa  >  3b  2  -  5  2  or  Pj^  if  a  <  3b  2  •  5  2. 


C.  DECISION  RULES  USING  A  BETA  (a  =  1/2,  p  =  1/2)  A  PRIORI 

DISTRIBUTION 

•So  far.  .n  the  continuous  cases,  an  equally  likely  a  priori  probability  distribution 
'  unil'crm)  and  a  symmetric  dome-shaped  a  priori  probability  distnbution  centered  at  G 
==  .511  t  Beta  2.2)  have  been  considered.  What  if  the  protagonist  feels  that  the 
antagonist  is  an  extrenust  type  of  thinker,  in  the  sense  that  he  has  just  as  high  a 
probability  of  acting  peacefully  as  he  dees  of  acting  aggressively  and  a  low  probability 
of  an  even  spilt  between  the  two'’  This  would  be  the  case  if  most  of  the  probability 
were  v.  eighted  at  the  values  of  G  =  0  and  G  =  -^  1.0  with  a  very  low  probability  of  G 
=  The  essence  of  this  type  of  thinking  could  be  captured  with  another  Beta 
ai'itribution  with  a  =  P  <  1.  This  is  a  U-shaped  probability  density  function  centered 
acain  -it  0  =  .50,  but  with  most  of  ihe  probability  weighted  at  the  extreme  values  of  G. 


Here  we  will  use  a  Beta  function  with  a  =  P  =  12.  The  probability  density  luncticn 
of  a  Beta  (1  2,1  2)  distribution  has  the  form; 


ni) 

f(0)  =  - (0r*  ^ ,  1-0)'^  “ , 

ni  2)r(i  2) 


f(0)  =  (1  JthOr^  -(1  -  0)‘'  “ . 
where  r(l)  Til  2)r(l  2)  =  1  (v 

When  this  conjectured  a  priori  probability  distribution  is  used  in  Bayes  theorem  we  get: 

(1  7rK0)'^  2(1-0)'^  -(0)Ml-0)‘‘^ 

f(0^  =  i)  =  - . 

Jed  7t)(0)-i  -(1-0)'*  -(e/(i.e)i-i  d0 

If  the  observation  of  Xj^  =  0,  then,  after  cancelling  the  common  factor  of  1  Tt.  the  a 
posteriori  distribution  becomes 

(0)-*  -(l-0)‘  - 

f(0:x^  =  oi=  - . 

Je  (0r‘  Mi-0)‘  -d0 

where  the  denominator  is 

rd2)r(3  2)  r(2). 

which  equals 


‘n 


The  a  posteriori  distribution  for  Xj^  =  0  becomes 


f  (0  I  Xjj  =  0)  =  (2  7t)(8r*'-  (l-e)''2  6 


ure  6.3  Beta  i  1  2,  1  2)  Case  -  A  Priori  and  A  Posterion  Distributions. 
In  the  instance  where  the  observation  yields  =  1.0.  Bayes  formula  gives 

(T  TT  M0)*  -  I  1-0  ' 

■  x„  -  10)  - - — . 

il  ::h0.'  -  .  l-e.''  -  je 


and  after  cancelling  the  common  factor  of  1  n.  the  a  posicnon  distribution  becom.es 


where  the  denominator  is 


r(3  2)r(I  2) 


7t  2. 


The  a  posteriori  distribution  for  =  1  becomes 


f(0  I  =  l.O)  =  (2  Ttxe)*  ^(1-9) 


I  2n.firr2 


(eqn  6.15) 


The  effect  that  Bayes  Theorem  has  on  the  a  priori  distribution  can  be  seen  in  Figure 
6.3  where  all  three  (one  a  priori  and  two  a  posteriori)  distributions  are  plotted. 

1.  Beta  (1/2. 1/2)  Case  Decision  Rule  with  ==  0 

The  decision  after  the  observation  of  Xj^  =  0  is  to  choose  alternative  if 

(2  mj’0:e(l-a)^aj{(0rl  2(1-0)*  2}  d9  >  (2.rt)j0{0(l  +  b)-l}((0)-*  2(1.0)!  2,.  ^0. 

which  simplifies  to  choosing  P,  if 


a  >  b  3  -  4  3, 


(eqn  6.16) 


or  P^  if 


a  <  b  3  -  4  3 


(eqn  6.17) 


This  result  can  be  seen  bv  notins  that  the  intcaral  on  the  left  is 


2i  1-a)  Kj'0  0*  2(  1.0)1  2^0  _  2a  7tj0  0'*  2  (1-0)1  -  d0. 


vielding 


[2(l-a)  :t]  [r(3  2)r{3  2)  r(3')]  ^  [2a  rt)  [HI  2)ri3  2)  r(2)], 


3a  4  1  4. 


The  integral  on  the  right  in  our  original  expression  is 


2(1  -^b)  (0)*  “(1-0)^  ^  d0  -  (2  tdJq  (0)'^  "^(1-0)*  “  d0 


-1  2m  0,1  2 


[2(1  ^h)  rt|  [r(3  2)r(3  2)  r(3)]  -  [2.JT1  [HI  2)r(3  2)  r(2)|, 


b  4  .  3  4. 


These  two  values  reduce  to  the  decision  rules  of  Equations  6.16  and  6.1"  to  choose 
alternative  P^.  if  a  >  b  3  -  4  3  or  if  a  <  b  3  -  4  3. 


2.  Beta  (1/2. 1/2)  Decision  Rule  with  Xj^  =  1.0 


The  decision  after  observation  of  Xj^  =  1.0  is  to  choose  alternative  P^,  if 
!2T:  ij0  :0(T-a)-al((0)^  2(l-0)-^  d0  >  '  2  7t)j0  {0{1  ^  bM)  {(0)*  2(1-0)'*  21’  je 


■  hich  simplifies  to  choosing  P^  if 


a  >  3b  -  2, 


(eqn  6. IS) 


i 


-  tTm.  r  ■ 


or  Ph  if 


a  <  .'b  -  2. 


feqn  6. 19 1 


This  result  can  be  seen  by  comparing  the  value  of  the  integral  on  the  left  to  the  one  on 
the  right,  where  the  integral  on  the  left  is 

{2(l-a)  7t;j0  \0)-^  '  (1-6)'^  “  d6  -  (2a  jdJq  (0)*  "  (1-0)'*  “  d0  . 

which  esaluates  to 

;2.  1-a)  rt;  ;r(5  2)r(I  2)  HaT  -  :2a  n}  (Ha  2)r(l  2)  r(2)). 


a  4  -r  14, 


Tile  integral  on  the  right  in  our  original  expression  above  is 


:<  1  -b.  ;rij0  0-’  -il-0r*  "  d0  -  (2  0'  -{l-0r*  -  d0 


winch  reduces  to 


[2ti-b)  It;  [r(5  2ini  2)  r(3)i  - 12  7t|[r(3  2ir(i  21  r(2ii. 


3b  4  -  1  4. 


The<e  two  values  result  in  the  decision  rules  of  Equations  6.18  and  6.19  to  choose  if 
a  >  3b  -  2  or  if  a  <  3b  -  2. 

Depending  on  his  subjective  feel  for  the  antagonists  intentions,  or  any 
historical  data  that  provide  a  reasonable  fit.  the  decision  maker  could  choose  of  one  of 
■he  three  continuous  a  prior:  probability  distributions  presented  m  this  chapter.  The  a 


prion  distribution  selected  and  the  subsequent  behavior  observed  will  determine  which 
decision  rule  he  will  follow.  .A  summarv'  of  the  results  of  this  chapter  are  presented  in 
■Appendix  A.  In  Chapter  \n  we  v.ill  show  some  specific  examples  for  the  cases 
presented  here  and  in  Chapter  \  .  Chapter  \  III  will  contain  concludine  remarks. 


VII.  SPECIFIC  EXAMPLES 


Ir.  this  chapter  we  will  look  at  specilic  examples  of  the  conjectured  a  rnon 
prohahihtjc  distributions  discussed  in  Chapters  V  and  VI.  There  will  be  four  cases  for 
tl'.e  probability  distributions  icne  discrete  and  three  continuous!  with  eieht 

examples.  The  discrete  case  will  use  four  proposed  values  of  6j..  N\'e  will  see  how  tire 
behaMcr  obseined,  through  the  use  of  Baxes  Theorem.  elTects  the  alternative  ahwh 
rr.aximizes  tiie  e.xpected  utility  value.  The  three  continuous  case  exam.ples  show  h.cw 
an  observation  of  each  type  of  behavior  elTects  the  decision  rule  used  and  ultiinateiv 
the  alternative  i  P,,  or  Pi.^!  chosen.  The  examples  used  in  this  chapter  will  assum.e  the 
l  e'iio  vving  preference  ordering  ofi 

V,P^,.A,)  >  ViP^.A,)  >  V,Pj^.Ah)  >  V,P^„Aj,) 
cr 

-  l.()  >  b  >  -1.0  >  a. 

We  also  assum.e  in  this  chapter  that  the  utility  values  chosen  for  a  =  ■  and  for 
1)  =  -  0,5, 

A.  DISCRETE  CASE  WITH  AGGRESSIVE  BEHAVIOR  OBSERVED 

\\'e  consider  th.e  instance  of  the  followng  a  pnon  probability  distribution! 

Pr(0  =  9  lb)  =  ( 1  8)  =  .125 
Pn0  =  3  4  )  =  (Mi  =  .25 
Pr(0  =  1  4  )  =  (  I  4)  =  .25 

Pr(Q  =  !  KM  =  t.e  8i  =  .3 "5. 

Now.  we  suppose  tiiat  the  protagonist  observes  aggressive  behavior  =  iM  on  the 

part  ol'  the  antagonist.  Baves  Theorem  gives  us  for  each  of  the  possible  values  of  0^. 


Pn 0  =  M  lo  Xj.j  =  b> 


;(1  lOHl.S)}  1(1  lOKl  S)-(l  4)(1  4)-(3  4)(1  4)-(9  10){3  S)’  =  (1  48)  =  .'i2. 


Sirrularlv, 


(I  4)(1  4) 


Pr(0=  3  4  X„  =  0)  = 


=  (5  48)  =  .10. 


(4S  SO) 


(3  4  )(1  4  ) 


Pr(0=  1  4  X^  =  0)  = 


=  (la  48)  =  .31. 


^48  SO) 


(9  10  H  3  S) 


Pr(0=  1  10  Xj^  =  0)  = 


=  (2'  48)  =  .56. 


(48  SO) 


The  course  of  acuon  to  choose  in  accordance  wuh  5.2  is  the  maximum  of 


V,  ail  -e^)Pr(0  =  Bj.,  X^  =  i  ). 


^^(be.  -  1)  Pr(0  =  0^  Xj^  =  1  ), 


the  maximum  of 


<-2.0);(l  lOKl  4S)-(1  4)(>  4S)*(3  4)(15  4S)  +  (9  10)(23  48)}  =  -1.5' 


cr 

1  -IS)-  4i-l!(5  4S)-  :(-.5)(  1  4)-l|(  15  48)-  :(-.5'(  1  10)-1  :(2~  4S 

'■vhich  equals  -  1.12.  The  protagonist  m  this  case  would  choose  P^,  s;nce  -1.52  is  les' 
titan  -1.12. 


B.  DISCRETE  CASE  WITH  PEACEFUL  BEHAVIOR  OBSERVED 

Ill's  section  uses  the  same  conjectured  probability  distribution  and  utility  -.alues 
siiown  in  tite  beginning  of  this  chapter  e.xcept  with  an  observation  of  peacefui 
’'chaMcr.  (X^.^  =  1.0).  Bayes  Theorem  gives  us  for  each  of  the  possible  values  of  0^ 

rl- 


Pn0=  9  10  ■X^=  I.O) 
n 


equals 


111  S)(9  10):  /  ,:il  8n9  10)-(]  4,m3  4)-(1  4)(T  4)  +  (3  8j(1  10)}  =  (9  32)  =  .28. 


Sinulariv. 


anu 


(I  4)(3  4) 

Pr(0=  3  4  Xj^=  1.0)  =  -  =  115  32)  =  .4 

(32  SO) 


a 


(1  4  M  1  4 1 

PnO-  14  Xj,=  l.(»)  =  -  =  (5  32)  =  16, 

■  ,'2  '<111 


I  1  I"  i<  1  Si 


1 


We  look  for  the  maximum  of 


(-2.0){(1,  10)(9,  32)  +  (l;4)(15  32)  +  (3  41(5.  32)  + (9 ’  10)(3  32)}  =  -  ."0 


or 

[(-.5k9  I0)-1}(9  32)^((-.5)(3  4)-l}(  15  32)^((-.5)(l  4).I}(5  32)+;(-.5)(l  10)-1)(3  32)  = 


Given  the  above  proposed  a  priori  probability  distribution  and  an  observation  of 
Xn  =  1.0,  the  protagonist  would  choose  alternative  P^,,  since  -.70  is  greater  than 

-1.33, 


C.  UNIFORM  CASE  WITH  AGGRESSIVE  BEHAVIOR  OBSERVED 

In  the  uniform  case  if  the  protagonist  observes  aggressive  behavior  then  the 
decision  rule  (Equation  6.4)  is  to  choose  if 

a  >  (I  2)b  -  (3  2). 

Since  a  =  -  2.0  and  b  -  •  .50,  the  the  protagonist  would  choose  P^. 

D.  UNIFORM  CASE  WITH  PEACEFUL  BEHAVIOR  OBSERVED 

The  uniform  case  with  an  obsersation  of  peaceful  behavior  =  1.0)  uses  the 
decision  rule  derived  in  Chapter  VI  (Equation  6.6).  This  equation  states  to  choose  P^  if 

a  >  2b  -  3, 

The  protagonist  would  choose  P^  as  his  choice  of  action. 


>4 


E.  BETA  (2.2)  CASE  WITH  AGGRESSIVE  BEHAVIOR  OBSERVED 

In  the  case  of  a  Beta  (2,2)  as  the  proposed  a  priori  probability  distribution  and 
an  observation  ofaggressive  behavior  (Xj^  =  0)  the  decision  rule  ^Equation  6.1U)  is  to 
choose  P ,  if 


a  >  (2  3)b  -  (5  3). 

The  prctasonist  would  find  that  he  is  indifTerent  betv/een  alternatives  and  Pu. 

F.  BETA  (2.2)  CASE  WITH  PEACEFL'L  BEHAVIOR  OBSERVED 

In  the  Beta  (2.2)  case  with  an  observation  of  peaceful  behavior  the  decision  rule 
(Equation  6.12)  is  to  choose  P^  if 


a  >  (3  2)b  -  (5  2), 

In  tl'iis  case,  substituting  in  the  values  for  a  and  b  will  lead  the  protagonist  to  select  P^. 


G.  BETA  (1/2. 1/2)  CASE  WITH  AGGRESSIVE  BEHAVIOR  OBSERVED 

A  Beta  (I  2,1  2)  with  =  0  as  the  observation  yields  Equation  6.1b  as  the 
decision  rule.  This  decision  rule  states  to  choose  P^  if 

a  >  I  1  3)b  -  (4  3), 

loading  the  protagonist  to  select  P^  as  his  course  of  action. 


H.  BETA  (1/2, 1/2)  C  ASE  WITH  PEACEFUL  BEHAVIOR  OBSERVED 

Using  a  Beta  (1  2.1  2)  as  the  proposed  a  priori  probability  distribution,  with  an 
observation  of  =  1.0.  gives  us  the  decision  rule  of  Equation  6. IS.  The  wilues  of  a 
and  and  tlie  rule. 


a  >  3b  -  2. 


lead  the  protagonist  to  select  alternative  P^. 

The  reader  is  reminded  that  while  these  specific  examples  appear  to  be  definitive 
in  their  conclusions  as  to  which  course  of  action  a  protagonist  should  select,  they  are 
based  on  subjective  assertions  or  historic  data  as  to  which  probability  distribution  best 
describes  the  proportion  of  actions,  by  the  antagonist,  that  are  peaceful.  These 
Jcasion  rules  are  o.nJi'  as  sood  as  the  distributions  thcv  are  based  on. 


VIII.  SUMMARY  AND  SUGGESTIONS  FOR  FURTHER  STUDY 


This  thesis  has  proposed  a  simple  2  \  2  decision  under  risk  matrix  as  a  mode!  for 
deci''ion  makinc  in  an  uncertain  environment.  This  matrix  is  referred  to  as  a  con'lict 
■situation  matrix.  Tire  2x2  matrix  design  uses  the  two  columns  as  two  unknown  states 
cf  nature  and  the  two  rows  as  the  decision  alternatives  for  a  protagonist.  Th.e  random 
>tates  of  nature  are  interpreted  as  being  either  the  occurrence  or  r.on-occurrcnce  ol' 
peaceful  behavior  on  the  part  of  an  antagonist.  The  probability  of  occurrence  of  either 
state  of  nature  is  estimated  through  the  use  of  Bayes  Theorem. 

Utility  \alues.  .’•ated  on  an  interval  scale,  are  assigned  to  the  four  cells  of  the 
matrix.  These  utility  values  are  used  as  payoffs  in  the  conflict  situation  matrix.  The 
estimated  probabilities  of  occurrence  of  peaceful  behavior  of  the  antagonist  are  used  to 
develop  a  Bayesian  decision  rule  to  maximize  expected  utility  value  for  the  protagonist. 
The  Bayesian  decision  rules  used  are  relative  to  the  a  posteriori  probability  distributions 
whereas  tlie  initial  a  priori  probability  distribution  estimates  are  subjective  or  based  on 
historic  data.  The  Bayesian  decision  rules  developed  use  discrete  and  continuous  a 
priori  probability  distributions.  The  decision  rule  for  the  discrete  case  is  to  choose  tire 
alternati\e  or  that  maximizes 

Xr  E  '  P,.er)  Pr'O  =  0^  !  =  i  ). 

'.vhere  ;  =  c  or  h  and  r  =  1,2....  This  decision  rule  and  its  continuous  case  analogue 
va;.  s  to  select  the  alternative  that  maximizes  expected  utility  relative  to  the  a  posteriori 
Ba>es;an  estimates.  The  three  continuous  case  a  priori  probability  distributions 
aor.'idered  arc  the  uniform,  the  Beta  (2,2)  and  the  Beta  (1  2,1  2).  The  general  form  of 
tne  de.i'iion  rule  for  these  cases  is  to  select  tiic  alternative  P^,  or  Pj^  that  maximazes 

fT.p,.8,f  e  x„  =  i)de, 

j  j  n 

■viicrc  :  =  c  or  h  and  9.  is  the  ccntmucus  u  rnori  probability  distribution  being 


The  model  proposed  here  provides  reasonable  decision  rules  that  are  not  counter¬ 
intuitive.  The  idea  of  categorizing  all  protagonist  and  antagonist  behaviors  (or  actions  i 
as  either  peaceful  or  aggressive  when  incorporated  with  the  Bayesian  decision  rules 
work  well,  and  make  for  a  usable,  easily  understood  model.  This  model  and  its 
decision  rules  may  also  provide  a  building  block  for  larger  models.  .An  area  that  could 
proside  room  for  expansion  might  include  a  larger  conflict  situation  matrix.  If  we 
allowed  the  pronmomst  and  the  antagonist  additional  alternatives,  (e.g.,  no  action 
taken,  as  suggested  in  in  the  fifth  chapter),  then  we  would  open  up  the  possibility  for 
an  n  \  m  conrlict  situation  matrix. 

.Another  additional  feature  that  wasn't  considered  here  would  be  to  let  the 
decision  maker  rate  all  the  outcomes  in  the  matrix  on  an  interval  scale.  This  would 
surely  aifect  the  decision  rules  with  the  addition  of  two  more  variables  and  would  allow 
all  combinations  of  preference  patterns  to  be  considered. 

N'ariance  arguments  for  the  a  priori  Beta  probability  functions,  with  a  equal  to  p, 
could  also  be  incorporated.  The  mean  of  these  density  functions  is  .50  as  ^  =  a  a-p. 
The  variance  of  a  Beta  distribution  is  given  by  (T“  =  ap  (a  p)^  (a  -  P  -  1 ).  This 
fact  could  he  used  for  the  a  =  P  >  1.0  case,  that  increasing  the  values  of  a  and  P, 
while  maintaining  their  equality,  decreases  the  variance  of  the  parameter  and  leaves  the 
mean  unchanged.  This  causes  the  distribution  to  weight  more  and  more  probability 
about  tlte  mean  and  less  and  less  at  the  extremes.  This  characteristic  of  the  Beta 
function  could  be  exploited  to  reflect  a  degree  of  confidence  by  the  decision  maker  in 
his  a  priori  assumption  of  a  Beta  distribution  with  a  =  P  >  1.  In  the  a  =  p  1.') 
ca<e  -.ve  would  have,  as  a  =  p  -»  -r-  l.O.  the  <T*  -»  0.  and  this  could  also  be  used  to 
reflect  the  decision  makers  confidence  in  his  a  pnon  assumption  of  a  Beta  distribution 

a  =  P  <1.0.  This  particular  characteristic  of  Beta  distributions  with  a  =  P 
wouid  provide  an  area  for  further  study.  Other  continuous  a  prion  probabilitv 
dl^tr;bt.tlons,  and  particularly  non-symmetric  Beta  distributions  (a*P).  should  also 
provide  interesting  further  research. 

The  purpose  of  this  model  is  to  analyze  the  probable  future  consequer.i.c’s  of 
de».!sicn  choices.  The  decision  criteria  used  in  the  mode!  proposed  is  highest  expected 
utiiity  value.  It  is  conceded  that  a  simple  model,  such  as  this,  and  its  suggested 
dcci-'ijn  criteria  cannot  capture  all  the  raaufications  and  consequences  of  pclitnal 
decis;or.>  that  befall  a  decision  maker  at  the  national  level.  While  it  may  be  true  'liat 
n.odels  cannot  substitute  for  real.t'.  and  anahsts  cannot  substitute  for  desiNivii 
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APPENDIX  A 

SUMMARY  OF  RESULTS  FOR  CONTINUOUS  CASES 


RESULTS  WITH 
UNIFORM  A  PRIORI 

A  PRIORI  PROBABILITY  DISTRIBL  TIOS 
fiO)  =  1 

A  POSTERIORI  for  action  =  i 


•'<n=  ‘ 

aggressi\-e) 

(peaceful) 

2  -  20 

20 

Choose  if 

a  >  1  :b  -  3  2  a  >  :b  -  3 

Choose  Pj^  if 


RESULTS  WITH 
beta  (2.2)  A  PRIORI 


.!  PRIORI  PROBABILITY  DISTRIBI  TIOS 
f ,0)  =  [r(4)  r(2)  r(2i| iSh i-Si 

A  POSTERIORI  for  action  =  i 

=  '->  =  .1 
[aggressive)  {peaceful) 

i2(0)(i-e)“  i:(0)-(i-0) 

Choose  Pj,  if 

a  >  2  3b  -  5  3  a  >  3  2b  -  5  2 

Choose  Pj^  if 


a  <  2  3b  -  5  3 


a  <  3  2b  -  5  2 


RESULTS  WITH 
BETA  (1/2, 1/2)  A  PRIORI 


.-(  PRIORI  PROBABILITY  DISTRIBl'TIOS 

t'0)  =  ir(i,‘  Ell  2)r{i  2)!(0)‘*  -(1-0)'^  - 


A  POSTERIORI  for  action  Af  =  i 


X 


n 


0 


( aggrcssr.'e) 


> 

(peaceful) 


7r(0t'*  “(1-0)’ 


2  ;r(0)’  -(1-0)'’  - 


Choose  if 

a  >  1  3b  -  4  3  a  >  3b  -  2 


Choose  if 


a  <  1  3b  -  4  3  a  <  3b  -  2 
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